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METHOD OF PRODUCING HUMAN GROWTH FACTORS FROM 
WHOLE PLANTS OR PLANT CELL CULTURES 



FIELD OF THE INVENTION 

The present invention relates generally to a method for producing human 
10 growth factors from whole plants or plant cell culture. More specifically, the 

invention relates to producing a human growth factor from a plant cell encoded to 
produce the human growth factor with a length of at least 200 amino acids from 
transgenic plant cells. 

15 BACKGROUND OF THE INVENTION 

Growth factors and monoclonal antibodies (Mabs) are diverse yet highly 
specialized types of proteins having research and commercial applications in areas 
of therapeutics and diagnostics. 

20 Therapeutic uses of human epidermal growth factor (hEGF) include treatment 

of soft tissue wounds (U.S. 5,218,093, 1993), specifically including skin and eye 
injuries as well as corneal and stomach ulcers (Frost and Sullivan 1996, 1994). In 
addition, several hEGF-bearing fusion constructs have been considered and/or 
tested, including mitotoxins for treatment of restenosis (Frost and Sullivan, 1994) 

25 and radioconjugates for a variety of anti-neoplastic therapies (Grieg et al., 1988). 

Current production techniques for these proteins such as hybridoma and other 
types of mammalian cell culture methods (Kohler and Miisten, 1975) are generally 
slow, labor intensive, and consequently, expensive. In addition, current 
production techniques are difficult to validate due to the pathogenic and oncogenic 

30 potential of cultivated mammalian tissue. 

Multimers of from 2 to 7 EGF units each having 53 amino acid residues 
have been produced from bacterial hosts, eg E. coli, Streptomyces and Bacillus, 
fungal hosts, eg Saccharomyces, Pichia and Aspergillus, insect cell host, and 
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mammalian cell hosts, eg CHO cells and COS cells. (U.S. Patent No. 5218093, 
1993). hEGF production in Staphylococcus aureus (U.S. Patent No. 5004686, 
1991) is by a fusion construct encoding hEGF linked to a protein. Synthesis 
methods using transgenic bacterial strains have problems such as faulty antibody 
5 gene expression, protein folding difficulties, inability to glycosylate proteins, and 
relegation of foreign peptides to insoluble material accumulated in inclusion 
bodies. 

Transgenic plants can be used for the production of high value, medicinally 
important proteins, for example, production of Mabs (Hiatt et al., 1989; During et 

10 al., 1990; Benvenuto et ai. 1991, Firek et al. 1993, Gao et al. 1993), human 
growth hormone (Kay et al. 1986) and human serum albumin (Sijmons et al. 
1990). Transformed cells synthesize, secrete, and accumulate functional 
antibodies including single (Benvenuto et al. 1991) and double (During et al. 1990, 
Hiatt et al. 1991) domain immunoglobulins. However, it is noted that none of 

15 these authors investigated production of any human growth factor from transgenic 
plants. 

Plant cell culture media are well-defined and inexpensive compared to 
mammalian cell culture media. Further, plant cell products, unlike mammalian- 
derived protein formulations, are generally assumed as neither pathogenic nor 

20 oncogenic to humans (Crawford, 1995). Also, when compared to similar 

production in transgenic bacterial strains (Attaai and Shuler 1987), plant tissue 
culture methods showed greater stability of foreign gene expression, even without 
use of selection pressure (Gao et al. 1991). One author, Higo et al. (1993) 
produced a human growth factor, specifically hEGF in transgenic tobacco with 

25 cDNA fragment size of 180 bp. Unsatisfactory foreign peptide levels of 20 to 60 
pg/mg (ppb) total soluble leaf protein were obtained. This is despite the fact that 
plant progeny appeared to produce high levels of hEGF mRNA. Exact reasons for 
low observed levels of hEGF production are unclear. However, no signal peptide 
was encoded upstream of hEGF cDNA which could cause the foreign protein to be 
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relegated to the cytosol. Within this cell fraction, hEGF suffers proteolytic attack, 
especially considering the relatively small size (53 amino acids) or the peptide. 

Although advantages have been observed for deriving proteins including EGF 
from plants, no transgenic plant cell culture process has been commercially 
5 developed for production of human growth factor. The lack of commercial 
exploitation of plant derived proteins is due in part to existing technological 
huixiles as observed by Higo et al. In addition, Ma et al., 1995 reported Mab 
titers of up to 500 /ig/g (ppm) fresh weight of plant material (or 300 mg/L on a 
cell culture basis) whereas comparable mammalian cell processes are reported to 

10 attain levels of 1-2 g/L and higher (Rosenberg, personal communication, 1995). 
Implementation of alternative production systems to mammalian and bacterial 
culture, such as plant cellular techniques, has been further limited by non- 
technological factors, such as industry and regulatory acceptance (Simonsen and 
McGrogan, 1994) because of the investment made in developing and validating the 

15 more established non-plant methods. 

Accordingly, there is a continuing need for plant based production of human 
growth factors. 
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20 
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Mostov K, Lehner T. 1995. Generation and Assembly of Secretory 
Antibodies in Plants. Science 268:716-719. 

5 19. Simonsen CC, McGrogan M. 1994. The molecular biology of production 
cell lines. Biologicals 22:85-94. 

SUMMARY OF THE INVENTION 

10 Despite the hurdles in technology development and commercialization, 

economic analysis indicates that regulatory costs associated with plant cell culture 
may reduce by as much as $70,000 per batch as compared to analogous 
mammalian cell processes (Crawford, 1995). In addition, direct production costs 
for whole plant processes at equal protein production rates appear to be two to 

15 four orders-of-magnitude lower than comparable mammalian cell processes 
(Agracetus 1995). Additionally, as plant cell titers increase, this type of 
production becomes even more capital cost-effective. 

It is, therefore, an object of the present invention to provide whole plant and 
plant cell culture derived human growth factors at higher overall concentrations 

20 and production rates, comparable to mammalian host cell systems. 

It is a further object of the present invention to synthesize specific human 
growth factors. 

It is another object of the present invention to increase production rates and 
concentrations by increasing protein stability through the use of fusion constructs. 
25 It is a further object of the present invention to use Nicotian? tabacum 

(tobacco) and Solarium tuberosum (potato) whole plants and highly synchronous 
suspensions. 

According to the present invention, the production of human growth factors 
is achieved in whole plants or plant cell culture wherein the human growth factor 
30 is produced with a length of at least 200 amino acids. For epidermal growth 
factor this would comprise at least a tetramer of EGF units. 
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Modifying chimeric cDNA and subcloning into a plant expression vector are 
done using standard molecular cloning procedures (Ausubel et al. 1992) and 
splicing PCR techniques (Marks et al. 1992). 

Effectiveness or production of the translation process has been increased 
5 according to the present invention by (1) cloning of pre-pro-EGF cDNA of 
approximately 4.5 kb into both whole plants and cell culture to increase overall 
titers of active hEGF, (2) synthesizing cDNA and transforming plants and cell 
culture for production of an oligomeric polypeptide consisting of repeated hEGF 
domains, and (3) increasing the overall size of the gene to be expressed with a 

10 fusion construct encoding hEGF linked to a protein that is efficiently produced in 
plant systems. As needed, synthetic cDNA includes plant-specific proteolytic 
cleavage sites between EGF repeats to facilitate correct processing in planta. 
Appropriate proteolytic cleavage sites upstream and downstream of hEGF are 
added if needed to obtain final product. 

15 The subject matter of the present invention is particularly pointed out and 

distinctly claimed in the concluding portion of this specification. However, both 
the organization and method of operation, together with further advantages and 
objects thereof, may best be understood by reference to the following description 
taken in connection with accompanying drawings wherein like reference characters 

20 refer to like elements. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 provides the size of EGF precursor (pre-pro-EGF) relative to 
25 correctly processed EGF. 

FIG. 2 depicts schematically the construction of pZD203, a vector used to 
modify the restriction sites on pre-pro-EGF to develop cDNA suitable for cloning 
into the plant expression vector pGA643. 

FIG. 3 depicts schematically the construction of pZD204, the plant 
30 expression vector carrying pre-pro-EGF. 
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FIG. 4 shows EGF levels seen in individual calli resulting from positive 
transformation and antibiotic selection. EGF concentrations were determined 
using enzyme-linked immunosorbent assay and are based on a 30 KD protein size. 

5 DESCRIPTION OF THE PREFERRED EMBODIMENT(s) 

The present invention is a method for production of human growth factors 
using whole plants as well as plant cell suspensions transformed with appropriately 
constructed vector plasmids, wherein the human growth factor is produced with a 
10 length of at least 200 amino acids. More specifically, the method of the present 
invention is stable expression of human growth factors of interest as direct 
therapeutics, targeted delivery systems and research reagents. Human growth 
factors produced include human epidermal growth factor (hEGF), transforming 
growth factor (TGF), vascular endothelial growth factor (VEGF), platelet-derived 
growth factor (PDGF), fibroblast growth factor (FGF), tumor necrosis factor 
(TNF), heparin-binding epidermal growth factor (HBEGF), insulin-like growth 
factor (ILGF), platelet-derived endothelial cell growth factor (PDECGF), platelet- 
derived angiogenesis factor (PDAF), and bone-and-cartilage inducing growth 
factor (BCIF). 

Any plant from the plant kingdom may be utilized. Specific types of plants 
that are amenable to the transformation steps listed herein include, but are not 
limited to monocotyledonous, dicotyledonous, and tuberous plants. Preferred 
species include but are not limited to Nicotiana tabacum (tobacco), Solarium 
tuberosum (potato), Glycine max (soybean), and Zea mays (corn). 

The method of the present invention, a method of producing human growth 
factors from plant cells, has the steps of: 

(a) obtaining a positive transformant of the plant cells, the positive 
transformant carrying genetic material encoding the production of a human growth 
factor with a length of at least 200 amino acids; 

(b) cultivating the positive transformant; and 
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(c) obtaining the human growth factors. 
The step of obtaining may be as simple as purchasing or more complex 
actual making by well known methods, for example direct particle bombardment 
as described in Gene Transfer by Particle Bombardment, Klein TM, Knowlton S, 
5 Arentzen R, Plant Tissue Culture Manual, Dl, pp 1-12, 1991, Kluwer Academic 
Publishers, or by Agrobaterhim mediated transformation as described in Hoekema 
et al. 1985 (Hoekema KM, Hirsch PR, Hooykaaf PJJ, Schliperoort RA, 1985, 
Nononcogenic Plant Vectors for Use in the Agrobacterium Binary System, Plant 
Molecular Biology, Vol. 5, 85-89), and further described herein. 
10 The step of cultivating involves either whole plant cultivating or tissue 

cultivating by any of well known cultivating methods. 

The step of obtaining is by well known separation purification steps, for 
example ultrafiltration, affinity chromatography, and/or electrophoresis. 

An Agrobacterium mediated transformation method of the present invention 
15 has the steps of: 

(a) modifying chimeric cDNA encoding a specific growth factor for 
subcloning into a plant expression vector 

(b) subcloning the chimeric cDNA into the plant expression vector; 

(c) transferring the plant expression vector containing transgenic plant 
20 cells to an agrobacterium; 

(d) co-cultivating a portion of the transgenic plant cells (suspension 
culture or leaf disks) with the agrobacterium; 

(e) selecting positive transformants from the co-cultivated culture on 
an antibiotic selective media; 

25 (0 permitting growth of the transgenic plant cells in whole plants or 

suspensions; and 

(g) extracting a liquid containing the human growth factor; 
wherein the improvement comprises: 

said human growth factor having a length of at least 200 amino acids. 
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Modifying chimeric cDNA and subcloning into a plant expression vector are 
done using standard molecular cloning procedures (Ausubel et al. 1992) and 
splicing PCR techniques (Marks et al. 1992). More specifically, modifying 
chimeric cDNA, has the steps of: 
5 (a) adding a transcription promoter to the upstream or 5' end of 

the chimeric cDNA; and 

(b) adding a transcription terminator to the downstream or 3* end 
of the chimeric cDNA. The transcription promoter and the transcription 
terminator are regulatory elements. 

10 Further, an additional regulatory element encoding a signal peptide may be added 
between the transcription promoter and the 5' end of the chimeric cDNA in order 
to relegate the product human growth factor to a specific cellular organelle. In 
addition, other regulatory elements may be added either between the promoter and 
the additional regulatory element encoding the signal peptide or at the 3' end of 

15 the chimeric cDNA to obtain greater mRNA stability between transcription and 
translation events. 

In either whole plants or cell cultures, to enhance expression of the chimeric 
gene (hEGF), the present invention further includes manipulation of a 35S 
promoter by duplication of the upstream region (-343 to -90 bp) of the CaMV 35S 

20 promoter to increase transcription activity, as well as use of TSC29 and TSC40 
promoters. These promoters and their transcription activity have been reported by 
Gao et al. 1994, and Dai et al. 1995. 

In whole plants, transcription promoters may include the upstream enhancer 
(nucleotides -343 to -90 relative to the transcription start site) of the CaMV 35S 

25 promoter (Benfey et al. 1989) or the chlorophyll a/b binding protein (cabl) 

promoter (Ha and An 1988). Use of these types of regulatory elements confers 
human growth factor production characteristics into traditionally non-salable 
portions of crop plants, such as the leafy tops of potatoes. Use of potato tops, for 
example, under post-harvest conditions, results in overexpression and production 



WO 98/21348 



PCT7US97/20603 



- 10- 

of human growth factor in non-salable plant portions towards the end of the 
harvesting season, without affecting crop quality. 

Transferring the plant expression vector into the agrobacterium is completed 
using the freeze-thaw method (An 1987). For monocotyledonous species, super- 
5 binary vectors, such as pTOK233 and pSB131, are used to achieve high 

transformation frequency (Ishida et al. 1996). Remaining cocultivation, selection, 
growth, and extraction steps (d through g) have been described by Magnusen et al. 
(1996), and are well known in the art of plant molecular biology. 

Many human growth factors possess relatively short lengths of between 50 

10 and 100 amino acids. For example, hEGF has a length of 53 amino acids. 
Accordingly, obtaining a larger construct of at least 200 amino acids requires 
either (1) cloning the larger precursor cDNA, (2) synthesizing a concatemer 
consisting of multiple gene copies encoding the growth factor, or (3) increasing the 
overall size of a gene to be expressed using a fusion construct encoding a growth 

15 factor linked to a protein that is efficiently produced in plant systems. 

An example of obtaining a larger precursor to increase the overall protein 
size is the cDNA encoding pre-pro-EGF. This particular gene, at approximately 
4.5 kb, encodes a 1207 amino acid protein that, in vivo, is proteolytically cleaved 
to yield 53 amino acid EGF. In plant systems, this larger protein will provide 

20 additional stability against proteolytic degradation. 

Synthesizing the cDNA concatemer is preferably done by ligating multiple 
gene copies using peptide linkers to obtain a processed protein length of at least 
200 amino acids. The multiple gene copies are preferably an oligomeric 
polypeptide having of repeated growth factor cDNA domains. Peptide linkers may 

25 be used that are (1) proteolytically cleaved in planta, (2) proteolytically cleaved in 
a separate enzymatic treatment step, or (3) resistant to proteolytic cleavage. 
Peptide linkers that are proteolytically cleaved by serine proteases in planta 
preferably possess the amino acid sequence Arg-Asn. This sequence already exists 
when EGF is concaternerized since the C-terminal amino acid is arginine and the 

30 N-terminal amino acid is asparagine. To achieve in planta cleavage, the processed 
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protein is targeted either to the cell cytosol (no signal peptide) or vacuole 
(phytohemagglutinin signal peptide [Chrispeels et al.1991]). To achieve 
proteolytic cleavage in a separate enzymatic treatment step, the same amino acid 
sequence is preferably used (Arg-Asn) and the growth factor concatemer is either 
5 targeted to the chloroplast (pea photosystem II signal peptide) or secreted (PR-II 
signal peptide) to limit proteolytic degradation. To achieve resistance to 
proteolytic cleavage, linkers would preferably possess the amino acid sequence 
Arg-Pro. This sequence is resistant to serine proteases. Specifically for EGF, 
linkage would preferably be achieved by synthesizing cDNA encoding a single 

10 proline unit between growth factor monomers cDNA. 

Increasing the overall size of a gene may be done by ligating EGF with 
cDNA encoding a protective protein to protect from proteolytic cleavage, thereby 
forming a fusion construct. Protective proteins include but are not limited to 
streptococcal protein G or -galactosidase, that have both been shown to inhibit 

15 proteolysis when attached to the C-terminus of other foreign proteins (Hellebust et 
al. 1989). Gene size could also be increased by ligating EGF with cDNA 
encoding another protective protein of commercial interest that processes well in 
plant-based systems. Protective proteins further include human serum albumin 
(Sijmons et al. 1990) and phytase (Verwoerd et al. 1995). 

20 At least one genetic regulatory element may be included in the cDNA 

encoding the transcription of specific growth factors. Regulatory elements include 
transcription promoters or enhancers that increase the frequency of transcription 
events, leader sequences that increase the stability of mRNA prior to translation, 
and signal peptides that target proteins to specific organelles for posttranslational 

25 modifications and accumulation. Examples of transcription enhancers include but 
are not limited to the octapine synthase enhancer, a 16 bp palindrome 
. (ACGTAAGCGCTTACGT) (Ellis et al. 1987) and the B-domain of the 
cauliflower mosaic virus 35S promoter (Kay et al. 1987). An example of a leader 
sequence includes but is not limited to alfalfa mosaic virus RNA4 leader sequence 

30 (Jobling and Gehrke 1987). Examples of signal peptides include but are not 
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limited to the tobacco PR-S signal peptide (Cornelissen et al. 1986) and the 
phytohemagglutinin signal peptide (Hunt and Chrispeels 1991). 

Example 1 

5 The bacteriophage XEGF116 (ATCC No. 59956) containing the gene 

encoding the full length polypeptide of human kidney pre-pro-EGF was obtained 
from ATCC. Pro-EGF (FIG. 1) is the 1207 amino acid precursor in which hEGF 
is flanked by polypeptide segments of 907 and 184 residues at its NH 2 - and 
COOH-termini, respectively (Bell et al., 1986). The remainder of the 4.8 kb pre- 

10 pro-EGF gene encodes native signal peptides at both the NH 2 - and COOH- termini 
of pro-EGF. The polypeptide contains a transmembrane (TM) binding region that 
facilitates proper cleavage in the endoplasmic reticulum. 

The full length of cDNA was excised with Sma I, Hind HI, and Eco RI 
restriction enzymes, as shown on FIG. 2, producing two separate fragments. 

15 These were sequentially ligated into compatible Sma I and Eco RI sites in 

pBluescript- creating the 7.5 kb plasmid pZD203. After proper orientation was 
confirmed, pre-pro-EGF cDNA was further excised with Xba I and Cla I 
restriction enzymes and ligated into compatible sites located between the CaMV 
35S promoter and T 7 transcription terminator of binary vector pGA643, forming 

20 the 16 kb plasmid pZD204 (FIG. 3). This plasmid was directly transferred into 
Agrobacterium tumefaciens LBA4404 using the freeze-thaw method (An 1987). 
The transferred plasmid was introduced into tobacco whole plants (by leaf disks) 
and calli (by suspension culture) by co-cultivation with the Agrobacterium thereby 
producing transfonnants. Over 200 specific samples of transformants were taken 

25 from the co-cultivation and separately placed on kanamycin selective media. The 
co-cultivated transformants that grew were positive transformants. The positive 
transformants were screened under kanamycin selection pressure and preliminary 
ELISA results indicated the presence of hEGF in tobacco calli. Accumulation 
levels of hEGF in select transgenic calli are shown on a ng/g fresh weight basis in 

30 FIG, 4. The bars in FIG. 4 represent a random sample of the specific samples of 
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transformants. The highest level of accumulation at approximately 400 ng/(g fresh 
weight cells) (ppb) corresponds to a concentration of 4.1 ng/(mg total soluble 
protein) (ppm) (based on a measured total soluble protein level of approximately 
98 mg/(g fresh weight cells)). The 4.1 ng/(mg total soluble protein) (ppm) 

5 corresponds to 4100 pg/(mg total soluble protein) (ppb) which is almost two 
orders-of-magnitude greater than the result of 60 pg/(mg total soluble protein) 
(ppb) reported by Higo et al. (1993). 

Further ELISA and Northern blot analyses were used to detect high levels of 
foreign protein production and mRNA transcription, respectively. Western blot 

10 analysis, completed to determine protein size, showed that specific EGF bearing 
constructs of 30 KD were produced. This size corresponds to approximately 250 
amino acids. 

Closure 

15 While a preferred embodiment of the present invention has been shown and 

described, it will be apparent to those skilled in the art that many changes and 
modifications may be made without departing from the invention in its broader 
aspects. The appended claims are therefore intended to cover all such changes and 
modifications as fall within the true spirit and scope of the invention. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: Brian S. Hook r, et al 

(ii) TITLE OF INVENTION: Method of Producing Human Growth 
Factors From Whole Plants or Plant Cell Cultures 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Paul W. Zimmerman 

(B) STREET: P.O. Box 999 

(C) CITY: Richland 

(D) STATE: WA 

(E) COUNTRY: USA 

(F) ZIP: 99352 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3 1/2 Magnetic Disk 

(B) COMPUTER: IBM compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: WORD97 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/747,246 

(B) FILING DATE: 11-12-96 

(C) CLASSIFICATION: unknown 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: N/A 

(B) FILING DATE: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Paul W. Zimmerman 

(B) REGISTRATION NUMBER : 34,761 

(C) REFERENCE/DOCKET NUMBER: E-1519 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 509-375-2981 

(B) TELEFAX: 509-375-2592 

(C) TELEX: 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4481bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: double strands 

( D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: Sense orientation of complementary DNA 

for pro-EGF 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 5'-AGT GAC TCA GTC GAG ... TTC TCA CTC 

GTC-3 end 

(v) FRAGMENT TYPE: 4.5kb Smal/Hindlll double strands DNA 

fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: kidney 

(B) STRAIN: human 

(C) INDIVIDUAL ISOLATE: GI Belle 

(D) DEVELOPMENTAL STAGE: adult 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE : 

(A) LIBRARY: fetal human liver library 

(B) CLONE: lambda CH4A; lambda EMBL4 ; lambda GM1416 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: human epithelial growth factor cDNA 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: cross-hybridization 

with mouse cDNA 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CCCGGGCCAT GCTCCAGCAA AATCAAGCTG TTTTCTTTTG AAAGTTCAAA CTCATCAAGA TT 62 

ATG CTG CTC ACT CTT ATC ATT CTG TTG CCA GTA GTT TCA AAA TTT AGT TTT GTT 116 

AGT CTC TCA GCA CCG CAG CAC TGG AGC TGT CCT GAA GGT ACT CTC GCA GGA AAT 170 

GGG AAT TCT ACT TGT GTG GGT CCT GCA CCC TTC TTA ATT TTC TCC CAT GGA AAT 224 

AGT ATC TTT AGG ATT GAC ACA GAA GGA ACC AAT TAT GAG CAA TTG GTG GTG GAT 278 

GCT GGT GTC TCA GTG ATC ATG GAT TTT CAT TAT AAT GAG AAA AGA ATC TAT TGG 332 

GTG GAT TTA GAA AGA CAA CTT TTG CAA AGA GTT TTT CTG AAT GGG TCA AGG CAA 38 6 

GAG AGA GTA TGT AAT ATA GAG AAA AAT GTT TCT GGA ATG GCA ATA AAT TGG ATA 44 0 

AAT GAA GAA GTT ATT TGG TCA AAT CAA CAG GAA GGA ATC ATT ACA GTA ACA GAT 4 94 

ATG AAA GGA AAT AAT TCC CAC ATT CTT TTA AGT GCT TTA AAA TAT CCT GCA AAT 54 8 

GTA GCA GTT GAT CCA GTA GAA AGG TTT ATA TTT TGG TCT TCA GAG GTG GCT GGA 602 

AGC CTT TAT AGA GCA GAT CTC GAT GGT GTG GGA GTG AAG GCT CTG TTG GAG ACA 656 
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TCA GAG AAA ATA ACA GCT GTG TCA TTG GAT GTG CTT GAT AAG CGG CTG TTT TGG 710 
ATT CAG TAC AAC AGA GAA GGA AGC AAT TCT CTT ATT TGC TCC TGT GAT TAT GAT 764 
GGA GGT TCT GTC CAC ATT AGT AAA CAT CCA ACA CAG CAT AAT TTG TTT GCA ATG 818 
TCC CTT TTT GGT GAC CGT ATC TTC TAT TCA ACA TGG AAA ATG AAG ACA ATT TGG 872 
ATA GCC AAC AAA CAC ACT GGA AAG GAC ATG GTT AGA ATT AAC CTC CAT TCA TCA 926 
TTT GTA CCA CTT GGT GAA CTG AAA GTA GTG CAT CCA CTT GCA CAA CCC AAG GCA 980 
GAA GAT GAC ACT TGG GAG CCT GAG CAG AAA CTT TGC AAA TTG AGG AAA GGA AAC 1034 
TGC AGC AGC ACT GTG TGT GGG CAA GAC CTC CAG TCA CAC TTG TGC ATG TGT GCA 1088 
GAG GGA TAC GCC CTA AGT CGA GAC CGG AAG TAC TGT GAA GAT GTT AAT GAA TGT 1142 
GCT TTT TGG AAT CAT GGC TGT ACT CTT GGG TGT AAA AAC ACC CCT GGA TCC TAT 1196 
TAC TGC ACG TGC CCT GTA GGA TTT GTT CTG CTT CCT GAT GGG AAA CGA TGT CAT 1250 
CAA CTT GTT TCC TGT CCA CGC AAT GTG TCT GAA TGC AGC CAT GAC TGT GTT CTG 1304 
ACA TCA GAA GGT CCC TTA TGT TTC TGT CCT GAA GGC TCA GTG CTT GAG AGA GAT 1358 
GGG AAA ACA TGT AGC GGT TGT TCC TCA CCC GAT AAT GGT GGA TGT AGC CAG CTC 1412 
TGC GTT CCT CTT AGC CCA GTA TCC TGG GAA TGT GAT TGC TTT CCT GGG TAT GAC 1466 
CTA CAA CTG GAT GAA AAA AGC TGT GCA GCT TCA GGA CCA CAA CCA TTT TTG CTG 1520 
TTT GCC AAT TCT CAA GAT ATT CGA CAC ATG CAT TTT GAT GGA ACA GAC TAT GGA 1574 
ACT CTG CTC AGC CAG CAG ATG GGA ATG GTT TAT GCC CTA GAT CAT GAC CCT GTG 1628 
GAA AAT AAG ATA TAC TTT GCC CAT ACA GCC CTG AAG TGG ATA GAG AGA GCT AAT 1682 
ATG GAT GGT TCC CAG CGA GAA AGG CTT ATT GAG GAA GGA GTA GAT GTG CCA GAA 1736 
GGT CTT GCT GTG GAC TGG ATT GGC CGT AGA TTC TAT TGG ACA GAC AGA GGG AAA 1790 
TCT CTG ATT GGA AGG AGT GAT TTA AAT GGG AAA CGT TCC AAA ATA ATC ACT AAG 1844 
GAG AAC ATC TCT CAA CCA CGA GGA ATT GCT GTT CAT CCA ATG GCC AAG AGA TTA 1898 
TTC TGG ACT GAT ACA GGG ATT AAT CCA CGA ATT GAA AGT TCT TCC CTC CAA GGC 1952 
CTT GGC CGT CTG GTT ATA GCC AGC TCT GAT CTA ATC TGG CCC AGT GGA ATA ACG 2006 
ATT GAC TTC TTA ACT GAC AAG TTG TAC TGG TGC GAT GCC AAG CAG TCT GTG ATT 2060 
GAA ATG GCC AAT CTG GAT GGT TCA AAA CGC CGA AGA CTT ACC CAG AAT GAT GTA 2114 
GGT CAC CCA TTT GCT GTA GCA GTG TTT GAG GAT TAT GTG TGG TTC TCA GAT TGG 2168 
GCT ATG CCA TCA GTA ATA AGA GTA AAC AAG AGG ACT GGC AAA GAT AGA GTA CGT 2222 
CTC CAA GGC AGC ATG CTG AAG CCC TCA TCA CTG GTT GTG GTT CAT CCA TTG GCA 2276 
AAA CCA GGA GCA GAT CCC TGC TTA TAT CAA AAC GGA GGC TGT GAA CAT ATT TGC 2330 
AAA AAG AGG CTT GGA ACT GCT TGG TGT TCG TGT CGT GAA GGT TTT ATG AAA GCC 2384 
TCA GAT GGG AAA ACG TGT CTG GCT CTG GAT GGT CAT CAG CTG TTG GCA GGT GGT 2438 
GAA GTT GAT CTA AAG AAC CAA GTA ACA CCA TTG GAC ATC TTG TCC AAG ACT AGA 24 92 
GTG TCA GAA GAT AAC ATT ACA GAA TCT CAA CAC ATG CTA GTG GCT GAA ATC ATG 254 6 
GTG TCA GAT CAA GAT GAC TGT GCT CCT GTG GGA TGC AGC ATG TAT GCT CGG TGT 2 600 
ATT TCA GAG GGA GAG GAT GCC ACA TGT CAG TGT TTG AAA GGA TTT GCT GGG GAT 2654 
GGA AAA CTA TGT TCT GAT ATA GAT GAA TGT GAG ATG GGT GTC CCA GTG TGC CCC 2708 
CCT GCC TCC TCC AAG TGC ATC AAC ACC GAA GGT GGT TAT GTC TGC CGG TGC TCA 2762 
GAA GGC TAC CAA GGA GAT GGG ATT CAC TGT CTT GAT ATT GAT GAG TGC CAA CTG 2816 
GGG GTG CAC AGC TGT GGA GAG AAT GCC AGC TGC ACA AAT ACA GAG GGA GGC TAT 2870 
ACC TGC ATG TGT GCT GGA CGC CTG TCT GAA CCA GGA AAT AGT GAC TCT GAA TGT 2924 
CCC CTG TCC CAC GAT GGG TAC TGC CTC CAT GAT GGT GTG TGC ATG TAT ATT GAA 2978 
GCA TTG GAC AAG TAT GCA TGC AAC TGT GTT GTT GGC TAC ATC GGG GAG CGA TGT 3032 
CAG TAC CGA GAC CTG AAG TGG TGG GAA CTG CGC CTG ATT TGC CCT GAC TCT ACT 3086 
CCA CCC CCT CAC CTC AGG GAA GAT GAC CAC CAC TAT TCC GTA AGA CAC GCT GGC 3140 
CAC GGG CAG CAG CAG AAG GTC ATC GTG GTG GCT GTC TGC GTG GTG GTG CTT GTC 3194 
ATG CTG CTC CTC CTG AGC CTG TGG GGG GCC CAC TAC TAC AGG ACT CAG AAG CTG 3248 
CTA TCG AAA AAC CCA AAG AAT CCT TAT GAG GAG TCG AGC AGA GAT GTG AGG AGT 3302 
CGC AGG CCT GCT GAC ACT GAG GAT GGG ATG TCC TCT TGC CCT CAA CCT TGG TTT 3356 
GTG GTT ATA AAA GAA CAC CAA GAC CTC AAG AAT GGG GGT CAA CCA GTG GCT GGT 3410 
GAG GAT GGC CAG GCA GCA GAT GGG TCA ATG CAA CCA ACT TCA TGG AGG CAG GAG 34 64 
CCC CAG TTA TGT GGA ATG GGC ACA GAG CAA GGC TGC TGG ATT CCA GTA TCC AGT 3518 
GAT AAG GGC TCC TGT CCC CAG GTA ATG GAG CGA AGC TTT CAT ATG CCC TCC TAT 3572 
GGG ACA CAG ACC CTT GAA GGG GGT GTC GAG AAG CCC CAT TCT CTC CTA TCA GCT 3626 
AAC CCA TTA TGG CAA CAA AGG GCC CTG GAC CCA CCA CAC CAA ATG GAG CTG ACT 3680 
CAG TGA 3686 



AAACTGGAAT TAAAAGGAAA GTCAAGAAGA ATGAACTATG TCG ATG CAC A GTATCTTTTC 374 6 

TTTCAAAAGT AGAGCAAAAC TATAGGTTTT GGTTCCACAA TCTCTACGAC TAATCACCTA 3806 

CTCAATGCCT GGAGACAGAT ACGTAGTTGT GCTTTTGTTT GCTCTTTTAA GCAGTCTCAC 3866 

TGCAGTCTTA TTTCCAAGTA AGAGTACTGG GAGAATCACT AGGTAACTTA TTAGAAACCC 3926 

AAATTGGGAC AACAGTGCTT TGTAAATTGT GTTGTCTTCA GCAGTCAATA CAAATAGATT 398 6 

TTTGTTTTTG TTGTTCCTGC AGCCCCAGAA GAAATTAGGG GTTAAAGCAG ACAGTCACAC 404 6 

TGGTTTGGTC AGTTACAAAG TAATTTCTTT GATCTGGACA GAACATTTAT ATCAGTTTCA 4106 

TGAAATGATT GGAATATTAC AAT AC CGT T A AGATACAGTG TAGGCATTTA ACTCCTCATT 4166 

GGCGTGGTCC ATGCTGATGA TTTTGCCAAA ATGAGTTGTG ATGAATCAAT GAAAAATGTA 4 226 
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ATTTAGAAAC TGATTTCTTC AGAATTAGAT GGCCTTATTT TTTAAAATAT TTGAATGAAA 4 28 6 

ACATTTTATT TTTAAAATAT TACACAGGAG GCCTTCGGAG TTTCTTAGTC ATTACTGTCC 4 34 6 

TTTTCCCCTA CAGAATTTTC CCTCTTGGTG TGATTGCACA GAATTTGTAT GTATTTTCAG 4 406 

TTACAAGATT GTAAGTAAAT TGCCTGATTT GTTTTCATTA TAGACAACGA TGAATTTCTT 4 4 66 

CTAATTATGA ATTC 4480 
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(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: double strands 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: sense orientation of five 

copies of mature EGF concatemers 

(iii) HYPOTHETICAL: 

<iv) ANTI-SENSE: 5'-CGC GTC AAG GGT ... TCT CAG TGA TAA-3 

end 

(v) FRAGMENT TYPE: 4.5kb Smal/Hindlll double strands DNA 

fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: kidney 

(B) STRAIN: human 

(C) INDIVIDUAL ISOLATE: Z.Dai, et al. 

(D) DEVELOPMENTAL STAGE: adult 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: fetal human liver library 

(B) CLONE: lambda CH4A; lambdaEMBL4 ; lambda GM1416 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY : Concatemer of mature EGF fragment 

without linker 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: PCR cloning 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


54 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


108 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


AAT 


162 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


216 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


270 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


AAT 


AGT 


324 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


TGC 


378 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


ATC 


432 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


AAT 


AGT 


GAC 


486 
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TCT GAA TGT CCC CTG TCC 
TAT ATT GAA GCA TTG GAG 
GAG CGA TGT CAG TAC CGA 
GAA TGT CCC CTG TCC CAC 
ATT GAA GCA TTG GAC AAG 
CGA TGT CAG TAC CGA GAC 
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CAC GAT GGG TAC TGC CTC CAT 
AAG TAT GCA TGC AAC TGT GTT 
GAC CTG AAG TGG TGG GAA CTG 
GAT GGG TAC TGC CTC CAT GAT 
TAT GCA TGC AAC TGT GTT GTT 
CTG AAG TGG TGG GAA CTG CGC 



GAT GGT GTG TGC ATG 54 0 
GTT GGC TAC ATC GGG 594 
CGC AAT AGT GAC TCT 64 8 
GGT GTG TGC ATG TAT 702 
GGC TAC ATC GGG GAG 756 

795 
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(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: double strands 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: sense orientation concateraer of 

mature EGF fragments with linkers 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 5 1 -GTC CAG AGC ... CAG TGA TAA-3 end 

(v) FRAGMENT TYPE: 5-copies of 159bp concatemer mature EGF 

linked with linkers 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: kidney 

(B) STRAIN: human 

(C) INDIVIDUAL ISOLATE: Z. Dai, et al 

(D) DEVELOPMENTAL STAGE: adult 

(E) HAPLOTYPE: 
<F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE: 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: concatemer of mature EGF linked with linkers 

(B) LOCATION: 

(C) IDENTIFICATION METHOD : PCR cloning 

(D) OTHER INFORMATION: Cleavage sites at 142-165, 307-331, 

465-489, 631-655. 



PUBLICATION INFORMATION: 


(A) 


AUTHORS : 


(B) 


TITLE: 


(C) 


JOURNAL: 


(D) 


VOLUME: 


(E) 


ISSUE: 


(F) 


PAGES : 


(G) 


DATE: 


(H) 


DOCUMENT NUMBER: 


(I) 


FILING DATE: 


(J) 


PUBLICATION DATE: 


(K) 


RELEVANT RESIDUES IN SEQ ID NO: 



FROM (position) TO (position) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


54 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


108 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


GGC 


162 


GGA 


AGA 


GTT 


AAC 


TGC 


ATG 


CAG 


AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


216 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


270 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


324 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


GGC 


GGA 


AGA 


GTT 


AAC 


TGC 


ATG 


CAG 


AAT 


AGT 


GAC 


TCT 


378 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


TGC 


ATG 


TAT 


4 32 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


ATC 


GGG 


GAG 


486 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


GGC 


GGA 


AGA 


GTT 


AAC 


540 


TGC 


ATG 


CAG 


AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


594 


CAT 


GAT 


GGT 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


648 


GTT 


GTT 


GGC 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


702 


CTG 


CGC 


GGC 


GGA 


AGA 


GTT 


AAC 


TGC 


ATG 


CAG 


AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


756 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


810 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


864 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 
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(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double strands 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: upstream enhancer (from -343 to 
-90 bp) of 35S promoter 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 253bp upstream of 35S promoter 

enhancer element 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: cauliflower mosaic virus (CaMV) 

(B) STRAIN: Cabb B-D 

(C) INDIVIDUAL ISOLATE: Z.Dai, et al 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE: 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY : 35S promoter B-domain enhancer 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: standard cloning 

(D) OTHER INFORMATION: B-domain of 35S promoter from EcoR V 
site to Hind II site (upstream enhancer region from -34 3 to -90 bp) 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

(xi| SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GTCAACATGG TGGAGCACGA CACACTTGTC TACTCCAAAA ATATCAAAGA TACAGTCTCA 60 
GAAGACCAAA GGGCAATTGA GACTTTTCAA CAAAGGGTAA TATCCGGAAA CCTCCTCGGA 120 
TTCCATTGCC CAGCTATCTG TCACTTTATT GTGAAGATAG TGGAAAAGGA AGGTGGCTCC 180 
TACAAATGCC ATCATTGCGA TAAAGGAAAG GCCATCGTTG AAGATGCCTC TGCCGACAGT 240 
GGTCCCAAAG ATGGACCCCC ACCCACGAGG AGCATCGTGG AAAAAGAAGA CGTTCCAACC 300 
ACGTCTTCAA AGCAAGTGGA TTGATGTGAT 330 
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(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: double strands 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 5 1 -untranscription region of 
chl a/b binding protein 

(iii) HYPOTHETICAL: 
{iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 11 kb EcoR 1 fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: whole plants 

(B) STRAIN: Arabidopsis 

(C) INDIVIDUAL ISOLATE: Ha et al 

(D) DEVELOPMENTAL STAGE: 30 day old seedlings 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic DNA library 

(B) CLONE: lambda bAT1005 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: arabidopsis cabl gene promoter 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: cross-hybridization 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAATTCATCA ACAAATTACT CCTCAATCAC ACTCCTATAG AAAACGGTTT AAGCTATCAT 60 
TACATGTCTA GTTGGTTTTA CTCAGCCCTA GAAGTGTTGT TTATTGCATC ACTTTCCACG 120 
AAGCACAATT TTTCTTTTTT ACAATCACTA GACCTCACAG GCTCACACAT ATGCTTTAGA 180 
GCACATTCTA AACTTTGAAC TATAAAAGCT GTTAACACTA AT AC ACT AT G CGTTCTTTTT 240 
TGCTCCAAAC ACTTTTGATC CATTATTAGG AGACACTCCA CTTAGAAAGA TTTTCTAATC 300 
CTTTGGTCAA CTAGGAAGTT CAAGGTTTTT CTAAACAGAA ATTCATTTCA CAAGTAATTT 360 
AATTTATAAG GAAATGAATA GAGAAATCAA ATCATTGAAG AACTACAAAA TATAGATTCA 420 
AGGTCAGGTC TAAGAAAATA TTCCTGAAGC TCAAAAAAGA GTTTTCCTCT CACATTATAG 480 
AATTGGCCTT TACTTCAACA TTTTCCCACC TATTCCACAT TTGGTCAGAA CATTTTTAAT 540 
TACTTGTGGA TCAATTTCCG GTTGAAATGG GTTTGGTGAA TATCCGGTTC AGTTATATGG 600 
TGGCCGTTGG AATTGGCTTA TTAGTTGTGG CCGTTGTTGA AGCCGTTGGT ATTGGTAAGG 660 
GAGAAGCAGA CTTGTGGCTA TGAGTCTATG ACCATGACTC GTGATTATGG AGCTGTCTTA 720 
TGACCCTGAC CATCACCTTG ATCTGGTGGA TTCCAATGTT TTCTTCTTCT TCTAATAAAA 78 0 
TATTATGGTC AATACAGGTG CTAATTAAGA TGGTAATAAT TTCTTATGTT TCTGTGGTAA 84 0 
AGTTTGATTC AATTCCGTAG TTTTAGATAA TCTTATTTCC ATACATAAAT TTTATAGTTT 900 
TATCTACTTT GTTCTTATGT TTTATCTCTA GCCAAGAGTT ATTATTATTA TCAGAAGAAG 960 
AAAAAAAAAA GAAGCATATA TACAAAAGGT TTAATAAAAT GT AT TAT AC A AGGCAATTAT 1020 
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CCAAATTTTT TTTGTTTTGG TTTACATTGA 
TCTATTCGTA TACGTGTCAC GTCATGAGTG 
GATATCTAAA ACACATATCA ATTGCGAATC 
AAACAAACAA TCTAAACCCC AAAAAAAATC 
GATATTTCAA GATAAGACAG TATTTAGATT 
ATACCAAACC ACCCATTTCT TGGCTTACAA 
GCACTACTCA ACCTTAATGG CCGCCTCAAC 
T 



-25- 

TGCTCTCAGG ATTTCATAAG GATAGAGAGA 1080 
GGTGTTTCGC CAATCCATGA AACGCACCTA 1140 
TGCGAAGTGC GAGCCATTAA CCACGTAAGC 1200 
TATGACTAGC CAATAGCAAC CTCAGAGATT 1260 
TCTGTATTAT ATATAGCGAA AATCGCATCA 1320 
CAACAAATCT TAAACGTTTT ACTTTGTGCT 1380 
AATGGCTCTC TCCTCCCCTG CCTTCGCCGG 1440 

1441 
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(7) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: double strands 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: CaMV 35S 5 1 -untranscription 
upstream 

(iii) HYPOTHETICAL: 

(iv) ANTI- SENSE: 

(v) FRAGMENT TYPE: Alu 1 (from 7143bp) -EcoRl (to 7517bp) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: cauliflower mosaic virus 

(B) STRAIN: CM4-184 

(C) INDIVIDUAL ISOLATE: RJ Shepherd 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic library of CM4-184 

(B) CLONE: pOS-1 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: CaMV 35S promoter 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: cross-hybridization 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCCACA GATGGTTAGA GAGGCTTACG CAGCAGGTCT CATCAAGACG ATCTACCCGA 56 
GCAATAATCT CCAGGAAATC AAATACCTTC CCAAGAAGGT TAAAGATGCA GTCAAAAGAT 116 
TCAGGACTAA CTGCATCAAG AACACAGAGA AAGATATATT TCTCAAGATC AGAAGTACTA 176 
TTCCAGTATG GACGATTCAA GGCTTGCTTC ACAAACCAAG GCAAGTAATA GAGATTGGAG 236 
TCTCTAAAAA GGTAGTTCCC ACTGAATCAA AGGCCATGGA GTCAAAGATT CAAATAGAGG 296 
ACCTAACAGA ACTCGCCGTA AAGACTGGCG AACAGTTCAT ACAGAGTCTC TTACGACTCA 356 
ATGACAAGAA GAAAATCTTC GTCAACATGG TGGAGCACGA CACACTTGTC TACTCCAAAA 416 
ATATCAAAGA TACAGTCTCA GAAGACCAAA GGGCAATTGA GACTTTTCAA CAAAGGGTAA 476 
TATCCGGAAA CCTCCTCGGA TTCCATTGCC CAGCTATCTG TCACTTTATT GTGAAGATAG 536 
TGGAAAAGGA AGGTGGCTCC TACAAATGCC ATCATTGCGA TAAAGGAAAG GCCATCGTTG 596 
AAGATGCCTC TGCCGACAGT GGTCCCAAAG ATGGACCCCC ACCCACGAGG AGCATCGTGG 656 
AAAAAGAAGA CGTTCCAACC ACGTCTTCAA AGCAAGTGGA TTGATGTGAT ATCTCCACTG 716 
ACGTAAGGGA TGACGCACAA TCCCACTATC CTTCGCAAGA CCCTTCCTCT ATATAAGGAA 776 
GTTCATTTCA TTTGGAGAGA ACACGGGGGA CTCTAGAGGA TCCCCGGGTG GTCAGT 832 
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(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 473bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : double strands 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 5 • -untranscription upstream of 
ribosomal protein L34 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 1500bp BamH-Hind 111 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: tobacco NT1 cells 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: Z.Dai, et al 

(D) DEVELOPMENTAL STAGE: 3 days old 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(HJ CELL LINE: NT1 
(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic library 

(B) CLONE: TSC 40 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY: RPL-34 promoter 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: plaque hybridization 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AGATCTCT CTTTGTATTC TTATTGATGT ACTGGTTTGA AGATGAATAA AATCTTTCAT 58 
•TCCACCAAAA AAAGAATGAA AATAAAATTT TAATATACAT GTTGATATAG ACAAAGAAGA 118 
AAAAAAAAGT TGTGATTACA TTTATTGACT ATTTGATGCC AATATCTATA ACTAGAGCTA 178 
TTTTCTATCA ATTATATGGG TATGTTGTTA TACCATGCCA AAACCTCAAT TCATAATGTG 238 
CTTGTTTAAA CCCAGTTTAA TGGGCTAACA TGTTGATGGG CTTATAGGCC CGTCTGATTT 298 
CCTTGCCAGA CACTAGTAAG TAAATGATTC TATCATCCAA TATCAACCGT GGGATCTAGG 358 
GCTTGTCCCA CTTATATACA CTACATATAT TTAACTTTCC TTTAGCCCTT CTGCTTCAGC 418 
CCCCAAAACA AAGAAAGAAG CTACAGAGAG AATAGCAGCG CCGCCGTGAA AAATG 473 
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(9) INFORMATION FOR SEQ ID NO: 8: 



(i) 



(ii) 



SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1162bp 

TYPE: Nucleic acid 
STRANDEDNESS: double strands 
TOPOLOGY: unknown 



<B) 
(C) 
(D) 

MOLECULE TYPE: 
(A) DESCRIPTION 



(iii) 

(iv) 

(v) 

(vi) 



S'-untranscription region of 35S gene 
from CaMV with 2 copies of B domains 

HYPOTHETICAL: 
ANTI-SENSE: 

FRAGMENT TYPE: 253bp Hindlll/EcoRV fragment + 343bp Hind 

11/EcoRl fragment 
ORIGINAL SOURCE: 

ORGANISM: whole cell 
STRAIN: CM4-184 

INDIVIDUAL ISOLATE: Z.Dai, et al 
DEVELOPMENTAL STAGE: 
HAPLOTYPE: 
TISSUE TYPE: 
CELL TYPE: 
CELL LINE: 
ORGANELLE: 
IMMEDIATE SOURCE: 

(A) LIBRARY: genomic library of CM4- 

(B) CLONE: POS-1 
(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT : 
MAP POSITION: 
UNITS : 



(vii) 



(A) 
(B) 
(C) 
(D) 
(E) 
(F) 
(G) 
(H) 
(I) 



-184 



(ix) 



(B) 
(C) 

FEATURE: 
(A) 



NAME /KEY: 



(x) 



CaMV 35S promoter with duplication 
of upstream B domain 

LOCATION : 

IDENTIFICATION METHOD: 
OTHER INFORMATION: 
PUBLICATION INFORMATION: 
(A) AUTHORS: 
TITLE: 
JOURNAL: 
VOLUME : 
ISSUE: 
PAGES : 
DATE: 

DOCUMENT NUMBER: 
FILING DATE: 
PUBLICATION DATE: 
RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



(B) 
(C) 
(D) 



(B) 
(C) 
(D) 
(E) 
(F) 
(G) 
(H) 
(I) 
(J) 
(K) 



8: 



CCC ACAGATGGTT AGAGAGGCTT ACGCAGCAGG TCTCATCAAG ACGATCTACC 
CGAGCAATAA TCTCCAGGAA ATCAAATACC TTCCCAAGAA GGTTAAAGAT GCAGTCAAAA 
GATTCAGGAC TAACTGCATC AAGAACACAG AGAAAGATAT ATTTCTCAAG ATCAGAAGTA 
CTATTCCAGT ATGGACGATT CAAGGCTTGC TTCACAAACC AAGGCAAGTA ATAGAGATTG 
GAGTCTCTAA AAAGGTAGTT CCCACTGAAT CAAAGGCCAT GGAGTCAAAG ATTCAAATAG 
AGGACCTAAC AGAACTCGCC GTAAAGACTG GCGAACAGTT CATACAGAGT CTCTTACGAC 
TCAATGACAA GAAGAAAATC TTCGTCAACA TGGTGGAGCA CGACACACTT GTCTACTCCA 
AAAATATCAA AGATACAGTC TCAGAAGACC AAAGGGCAAT TGAGACTTTT CAACAAAGGG 
TAATATCCGG AAACCTCCTC GGATTCCATT GCCCAGCTAT CTGTCACTTT ATTGTGAAGA 
TAGTGGAAAA GGAAGGTGGC TCCTACAAAT GCCATCATTG CGATAAAGGA AAGGCCATCG 
TTGAAGATGC CTCTGCCGAC AGTGGTCCCA AAGATGGACC CCCACCCACG AGGAGCATCG 
TGGAAAAAGA AGACGTTCCA ACCACGTCTT CAAAGCAAGT GGATTGATGT GATAACATGG 
TGGAGCACGA CACACTTGTC TACTCCAAAA ATATCAAAGA TACAGTCTCA GAAGACCAAA 
GGGCAATTGA GACTTTTCAA CAAAGGGTAA TATCCGGAAA CCTCCTCGGA TTCCATTGCC 
CAGCTATCTG TCACTTTATT GTGAAGATAG TGGAAAAGGA AGGTGGCTCC TACAAATGCC 
ATCATTGCGA TAAAGGAAAG GCCATCGTTG AAGATGCCTC TGCCGACAGT GGTCCCAAAG 



53 

113 

173 

233 

293 

353 

413 

473 

533 

593 

653 

713 

773 

833 

893 

953 
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ATGGACCCCC ACCCACGAGG AGCATCGTGG 
AGCAAGTGGA TTGATGTGAT ATCTCCACTG 
CTTCGCAAGA CCCTTCCTCT ATATAAGGAA 
CTCTAGAGGA TCCCCGGGTG GTCAGT 



-29- 

AAAAAGAAGA CGTTCCAACC ACGTCTTCAA 1013 
ACGTAAGGGA TGACGCACAA TCCCACTATC 1073 
GTTCATTTCA TTTGGAGAGA ACACGGGGGA 1133 

1159 
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CLAIMS 



We claim: 



1. 



A method of producing human growth factors from plant cells, 



5 comprising the steps of: 



(a) obtaining a positive transformant of the plant cells, the positive 



transformant carrying genetic material encoding the production of a human growth 
factor with a length of at least 200 amino acids; 



2. The method as recited in claim 1, wherein obtaining the positive 
transformant has the step of: 

modifying a chimeric cDNA encoding the human growth factor with a 
15 length of at least 200 amino acids, for subcloning into a plant expression vector. 

3. The method as recited in claim 2, further comprising the steps of: 

(a) subcloning the chimeric cDNA into the plant expression vector, 
and obtaining a subcloned plant expression vector; 
20 (b) transferring the subcioned plant expression vector into a 

plurality of plant cells; 

(c) selecting a plurality of positive transformants from the plurality of 
plant cells on an antibiotic selective media; 

(d) permitting growth of the portion of the plurality of plant cells in 
25 whole plants or suspensions; and 

(e) extracting a liquid containing the human growth factor from the 
plurality of transgenic plant cells. 



10 



(b) cultivating the positive transformant; and 

(c) obtaining the human growth factors. 



30 



4. The method as recited in claim 3, wherein transferring is by direct 
particle bombardment. 
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5. The method as recited in claim 3, wherein transferring is by 
Agrobacterium mediated transformation. 

6. The method as recited in claim 5, wherein Agrobacterium mediated 
5 transformation comprises the steps of: 

(a) placing the subcloned plant expression vector to an 
agrobacterium; 

(b) co-cultivating the Agrobacterium containing the subcloned 
plant expression vector with the plurality of plant cells. 

10 

7. The method as recited in claim 1, wherein the step of cultivating is 
with a whole plant. 

8. The method as recited in claim 1, wherein the step of cultivating is 
15 with a plant tissue culture. 

9. The method as recited in claim 1, wherein the step of obtaining is 
selected from the group consisting of ultrafiltration, affinity chromatography, and 
electrophoresis. 

20 

10. The method as recited in claim 1, wherein the length of at least 200 
amino acids is obtained by cloning a cDNA. 

11. The method as recited in claim 10, wherein said cDNA is a pre-pro- 
25 EGF cDNA. 

12. The method as recited in claim 1, wherein the length of at least 200 
amino acids is obtained by synthesizing a cDNA. 
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13. The method as recited in claim 12, wherein said synthesizing is 
concatomerizing multiple gene copies to obtain the length of at least 200 amino 
acids. 

5 14. The method as recited in claim 1, further comprising increasing an 

overall size of a gene to be expressed with a fusion construct encoding an hEGF 
linked to a protein that is efficiently produced in plant systems. 

15. The method as recited in claim 1, wherein said human growth factor 
10 is selected from the group consisting of epidermal growth factor (EGF), 

transforming growth factor (TGF), vascular endothelial growth factor (VEGF), 
platelet-derived growth factor (PDGF), fibroblast growth factor (FGF), tumor 
necrosis factor (TNF), heparin-binding epidermal growth factor (HBEGF), insulin- 
like growth factor (ILGF), platelet-derived endothelial cell growth factor 
15 (PDECGF), platelet-derived angiogenesis factor (PDAF), and bone-and-cartilage 
inducing growth factor (BCDF) and combinations thereof. 

16. The method as recited in claim 2, wherein modifying is by adding a 
regulatory element selected from the group consisting of leader sequences, signal 

20 peptides, transcription promoters or enhancers, and transcription terminators. 

17. The method as recited in claim 2, wherein modifying a chimeric 
cDNA, comprises the steps of: 

(a) adding said transcription promoter to the upstream or 5' end of 
25 the chimeric cDNA; and 

(b) adding said transcription terminator to the downstream or 3' 

end of the chimeric cDNA. 

18. The method as recited in claim 17, further comprising adding an 

30 additional regulatory element encoding a signal peptide, said additional regulatory 
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element added between the transcription promoter and the upstream 5' end of the 
chimeric cDNA. 

19. The method as recited in claim 18, farther comprising adding a 
5 regulatory element between the transcription promoter and the additional 

regulatory element encoding the signal peptide to enhance mRNA stability. 

20. The method as recited in claim 18, further comprising adding a 
regulatory element at the downstream or 3' end of the chimeric cDNA to enhance 

10 mRNA stability. 

21. The method as recited in claim 17, wherein transcription promoters 
limit growth factors production to a non-crop portion of a transgenic whole plant. 

15 22. The method as recited in claim 21, wherein the transcription promoters 

are selected from the group consisting of an upstream enhancer region (-343 to -90 
bp) of a CaMV 35S promoter, a chlorophyll a/b binding promoter (cab 1) and 
combinations thereof. 

20 23. The method as recited in claim 17, wherein the transcription promoters 

are selected from the group consisting of a modified 35S promoter, TSC29 
promoter, TSC40 promoter and combinations thereof. 

24. The method as recited in claim 23, wherein the modified 35S 

25 promoter is a 35S promoter modified by duplicating an upstream enhancer region 
(-343 to -90 bp) of the 35S promoter to increase transcription activity. 

25. The method as recited in claim 2, wherein said cDNA is a pre-pro- 
EGF cDNA. 



30 
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26. The method as recited in claim 25, wherein said pre-pro-EGF cDNA 
has approximately 4.5 kb, whereby overall titers of active hEGF in both whole 
plants and cell culture are increased. 

5 27. The method as recited in claim 2, wherein the length of at least 200 

amino acids is obtained by synthesizing the cDNA. 

28. The method as recited in claim 27, wherein said synthesizing is 
concatomerizing multiple gene copies to obtain the length of at least 200 amino 

10 acids. 

29. The method as recited in claim 28, wherein said multiple gene copies 
are an oligomeric polypeptide having of repeated hEGF domains. 
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30. The method as recited in claim 2, further comprising increasing an 
overall size of a gene to be expressed with a fusion construct encoding an hEGF 
linked to a protein that is efficiendy produced in plant systems. 
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